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Abstract. It is known that families of graphs with a semialgebraic edge re¬ 
lation of bounded complexity satisfy much stronger regularity properties than 
arbitrary graphs, and that they can be decomposed into very homogeneous 
semialgebraic pieces up to a small error (e.g., see |33l 1161118p . We show 

that similar results can be obtained for families of graphs with the edge re¬ 
lation uniformly definable in a structure satisfying a certain model theoretic 
property called distality, with respect to a large class of generically stable 
measures. Moreover, distality characterizes these strong regularity properties. 
This applies in particular to graphs definable in arbitrary o-minimal structures 
and in p-adics. 


1. Introduction 

In this paper by a graph we always mean an undirected graph, i.e. a graph 
G = (U, E) consists of a set of vertices V together with a symmetric set of edges 
ECV xV. 

As usual we say that a subset Vq C V is homogeneous if either {v,v') G E for all 
V ^ v' gVq or {v, v') ^ E for all v ^ v' gVq, i.e. the induced graph on Vq is either 
complete or empty (we ignore the diagonal). 

A classical theorem of Erdos-Szekeres m states that every graph on n vertices 
contains a homogeneous subset of size at least ^logn (all log’s are of base two), 
and this bound is tight up to a constant multiple. 

Since the families of graphs with a forbidden induced subgraph have much 
stronger structural properties than arbitrary graphs, they have much bigger ho¬ 
mogeneous subsets. 

Theorem 1.1 (Erdos-Hajnal, [13)1. For any finite graph H there is a constant c = 
c{F[) > 0 such that every El-free finite graph on n vertices contains a homogeneous 
subset of size at least 

However the following conjecture is widely open (see e.g. [51 [TU]~). 

Erdos-Hajnal Conjecture. For every finite graph El there is a constant <5 = 
5{H) > 0 such that every il-free graph on n vertices has a homogeneous subset of 
size at least . 

In the bi-partite case one has better bounds. Let G = (V, E) be a graph. We 
say that a pair of subsets Vi, V 2 C V is homogeneous if either Vi x V 2 G E or 
(Vi X U2) HE = 0. 
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Theorem 1.2 (Erdos, Hajnal and Each [2]). For any finite graph H there is a con¬ 
stant S = S{F[) > 0 such that every H-free graph on n vertices has a homogeneous 
pair Vi,V 2 with \Vi\, IV 2 I > . 

The following definition is taken from m- 

Definition 1.3. Let Q he a class of finite graphs 

(1) Q has the Erdos-Hajnal Property if there is 5 > 0 such that every G = (V, G) G 
Q has a homogeneous subset Vq of size |Vb| > 11^1^ 

(2) Q has the strong Erdos-Hajnal Property if there is <5 > 0 such that every G G Q 
has a homogeneous pair Vi, V 2 with |Vi|, IV 2 I > 5|E|. 

Remark 1.4. Is is shown in [2] that if a family of finite graphs Q has the strong 
Erdos-Hajnal property and is closed under taking induced subgraphs then it has 
the Erdos-Hajnal property. 

In this paper we consider families of graphs whose edge relations are given by a 
fixed definable relation in a first-order structure. 

Definition 1.5. Let A4 be a first-order structure and R C x be a definable 
relation. Consider the family Qn of all finite graphs V = (G, E) where G C is a 
finite subset and E = (V x V)nR. We say that R satisfies the (strong) Erdos-Hajnal 
property if the family Qr does. 

We extend this notion to the bi-partite case. 

Definition 1.6. Let At be a first-order structure and R C x M" a definable 
relation. 

(1) A pair of subsets A C M™, B C M" is called R-homogeneous if either Ax B C 
i? or (A X H) n i? = 0. 

(2) We say that the relation R satisfies the strong Erdos-Hajnal property if there 
is a constant 6 = 6{R) > 0 such that for any finite subsets A C M™, B C M" 
there are Aq Q A, Bq Q B with |Ao| > <5|A|, |i?o| > S\B\, and the pair Ao,Bo 
is i?-homogeneous. 

Our motivation for this work comes from the following remarkable theorem by 
Alon et al. 

Theorem 1.7 (PJ Theorem 1.1]). If R E M" x R’" is a semialgebraic relation then 
R has the strong Erdos-Hajnal property. 

Remark 1.8. (i) Although it is not stated explicitly in [2], but can be easily 

derived from the proof, homogeneous pairs in the above theorem can be 
chosen to be relatively uniformly definable. 

(ii) The above theorem was generalized by Basu (see [5]) to (topologically closed) 
relations definable in arbitrary o-minimal expansions of real closed fields. 

Besides the Erdos-Hajnal property for semialgebraic graphs, the above theorem 
has many other applications including unit distance problems [32] , improved bounds 
in higher dimensional semialgebraic Ramsey theorem m, m Theorem 1.2], algo¬ 
rithmic property testing m, and can also be used to obtain a strong Szemeredi-type 
regularity lemma for semialgebraic graphs [igiiH] (see also Section [5|). 

The aim of this article is to demonstrate that the above result by Alon et ah, 
along with its key implications, holds at a much larger level of generality, namely for 
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families of graphs whose edge relation is definable in a structure satisfying a certain 
model theoretic property called distality (see Section [2^ and with respect to the 
class of the so-called generically stable measures (as opposed to just the counting 
ones, see Section 12.41) . In particular, this applies to graphs definable in arbitrary 
o-minimal structures and in p-adics with analytic expansions, with respect to the 
Lebesgue (respectively, Haar) measure on a compact interval (respectively, compact 
ball). 

The following is one of the key results of our paper (see Theorem 13.II) . 

Theorem 1.9. Let Ai be a distal structure, and R C a definable relation. 

Then there is a constant S = S{R) > 0 such that for any generically stable measures 
/ii,/i 2 on M” and respectively there are definable sets A C M", B C with 
h'liA) > S, P 2 {B) > 5, and the pair A, B is R-homogeneous. 

Remark 1.10. It is not hard to see that our Theorem 11.91 implies Theorem 11.71 
by taking J\4 to be the ordered field of real numbers, and considering measures 
concentrated on finite sets. Thus distal structures provide a natural framework for 
a model theoretic approach to Ramsey-type results in geometric combinatorics. 

Remark 1.11. It is demonstrated by Malliaris-Shelah in (see also [S] for an 
alternative proof) that if is a stable structure and R C x is a definable 
relation then the family of finite graphs Or has the Erdds-Hajnal property. How¬ 
ever in general this family does not have the strong Erdds-Hajnal property (see 
Section EH]). 

Remark 1.12. Our proof of Theorem II.91 (and the density version in Corollary 14.61) 
gives explicit bounds on 5 and the number of the parameters in the definitions 
of A and B in terms of the VC-density of the edge relation R. In particular, for 
o-minimal structures and for p-adics, we obtain a bound in terms of the number 
of variables involved in R, due to the corresponding bounds for VC-density from 
[Ij. We were informed by Pierre Simon that after reading our paper he had found 
another proof of this result which is faster, but does not give bounds. 

A brief summary of the paper. In Section [2] we introduce the context and the 
notation: first-order structures and definable sets, distality, Keisler measures and 
generic stability. In SectionEjwe prove a definable generalization of Theorem 1 1. 71 for 
bi-partite graphs. In Section |4| we improve it to a density version, using which we 
obtain an analogue for hypergraphs and a version allowing additional parameters in 
the definition of the edge relation. This gives in particular a lot of new families of 
graphs satisfying the strong Erdds-Hajnal property (see ExamDle l4.11|) . In SectionEI 
we obtain a strong regularity lemma for hypergraphs definable in distal structures, 
generalizing the result for semialgebraic hypergraphs from [mill]. 

In Section El we consider the converse to our results from the previous sections. 
First, in Section [6.II we demonstrate a very explicit failure of the definable counter¬ 
part of Theorem 11.71 in the theory of algebraically closed fields of positive charac¬ 
teristic, even without requiring definability of the homogeneous subsets. It follows 
in particular that every field interpretable in a distal structure is of characteristic 
0. In Section 16.21 we prove that distality of a structure is in fact equivalent to the 
definable counterpart of Theorem 11.71 

Some further questions concerning incidence phenomena and higher-dimensional 
Ramsey theory in our setting will be addressed in a future paper. 
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2. Preliminaries 

2.1. Model theoretic notation. We assume familiarity only with the very basic 
notions of model theory such as first-order structures and formulas that can be 
found in any introductory model theory book (e.g. [30)1. By a structure we always 
mean a first-order structure. 

Our notations are standard. We will denote first-order structures by script letters 
etc., and use letter M,N etc. to denote their underlying sets. Very often 
we will not distinguish singletons and tuples: e.g. we may use x to denote a tuple 
of variables (a;i, ..., x„), use a to denote an element of M", and then we use |x| to 
denote the length of the tuple x = (xi,..., x„). 

If At is a structure and (j>{x,y) is a formula in the language of A4, then for 
a G as usual, by f/>(M, a) we will denote the subset of defined by (j){x, a), 

namely a) = {b G : A4 \= a)}. 

A subset X C M” is called definable if there is a formula (j)[x,y) and a G 
such that X = a); if we want to specify the set of parameters, then for A C M 

a definable subset X C M" is called A-definable (or definable over A) if we can 
choose a as above in Also if we want to specify (j) we say that such a set X is 
(((-definable. 

2.2. VC-dimension and NIP. Vapnik-Chervonenkis dimension, or VC-dimension, 
is an important notion in combinatorics and statistical learning theory. Let X be 
a set, finite or infinite, and let A be a family of subsets of X. Given A C A, we 
say that it is shattered by X if for every A' C A there is some S G T such that 
A n 5* = A'. A family X is said be a VC-class if there is some n < oj such that no 
subset of X of size n is shattered by T. In this case the VC-dimension of X, that 
we will denote by VC'(J'), is the smallest integer n such that no subset of X of size 
n -I- 1 is shattered by T. For a set R C A, let A fl R = {A fl R : A G A} and let 
TTjr (n) = max{|A fl R| : R C A, |R| = n}. 

Fact 2.1 (Sauer-Shelah lemma). IfVC{J-) < d then for n > d we have {n) < 

If S' C A is a subset and xi,..., x^ G A, we let Av(xi,..., x„; S) = ;i|{i < n : 
Xi G S}| (we don’t assume that the points Xi, ..., x„ are distinct). 

Fact 2.2 (VG-theorem [43], see also |2T1 Section 4] for a discussion). For any 
k > 0 and e > 0 there is n = 0(fc(i)^logi) satisfying the following. For any 
finite probability space (A, y) and a family T of subsets of X of VC-dimension 
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< k, there are some xi,... G X such that for any S € tF we have |^(5') — 
Av{xi,... ,Xn', S')! < e. 

An important class of NIP theories was introduced by Shelah in his work on the 
classification program m- It has attracted a lot of attention recently, both from 
the point of view of pure model theory and due to some applications in algebra and 
geometry. We refer to min] for an introduction to the area. 

As was observed early on in [25] , the original definition of NIP is equivalent to 
the following one (see |4] for a more detailed account). 

Definition 2.3. Let T be a complete theory and (j>{x, y) a formula in T, where x, y 
are tuples of variables, possibly of different length. We say that the formula (j){x, y) 
is NIP if there is a model Ad of T such that the family of sets {(j>{M, a) : a G 
is a VC-class. In this case we define the VC-dimension of (f){x,y) to be the VC- 
dimension of this class. (It is easy to see that by elementarily equivalence the above 
does not depend on the model M of T.) 

A theory T is NIP if all formulas in T are NIP. 

Slightly abusing terminology we say that a structure M is NIP if its complete 
theory Th(Ad) is NIP. Restated differently, a structure A4 is an NIP structure if 
for every formula (j){x,y) the family of (()-definable sets = {(j){M,a) : a € 
is a VC-class. 

Given a set of formulas A(x, y) and a set B C we say that 7r(x) is a A-type 
over B if 7r(x) C y)gA fees ^)} there is some M P Ai and 

some a G satisfying simultaneously all formulas from 7r(x). By a complete 
A-type over B we mean a maximal A-type over B. We will denote by Sa{B) 
the collection of all complete A-types over B. In view of the remarks above, the 
following is an immediate corollary of the Sauer-Shelah lemma. 

Fact 2.4. A structure M is NIP if and only if for any finite set of formulas A(x, y) 
there is some d G N such that |S'a(I?)| = 0(|i?|'^) for any finite B C 


2.3. Distality. The class of distal theories is defined and studied in [38], with the 
aim to isolate the class of purely unstable NIP theories (as opposed to the class 
of stable theories which are always NIP, see also [39]). The original definition is 
in terms of some properties of indiscernible sequences, but the following explicit 
combinatorial characterization of distality given in [8] can be used as an alternative 
definition. 

Fact 2.5. Let T be a complete NIP theory and Ai a model ofT. The following are 
equivalent. 

(1) T is distal (in the sense of the original definition, see Fact f^ - 

(2) For every formula (^(x, y) there is a formula tpix, yi,..., yn) with \yi\ = • • • = 

\yn\ = |y| such that: for any finite B C with \B\ > 2 and any a G 

there are 6i,..., G B such that A4 ^ 'f’ia, bi,..., 6„) and ifix, bi,..., 6„) h 
tp^{a/B) (i.e. for any b G B either (({M, b) V ipiM, bi,..., 6„) or h) fl 

i/-(M,6i,...,6„) = 0;. 

Remark 2.6. It is not hard to see that if Ai satisfies Fact 12.51 21 for all formulas 
(j>{x, y) with |x| = 1 then it satisfies it for all formulas, i.e. Ai is distal. Besides, any 
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A4 satisfying 12.5r 2') is automatically NIP (easy to see using the equivalence from 
Fact 12.4L so the assumption that A4 is NIP in Fact 12.51 is used to deduce (2) from 
(!)• 

Remark 2.7. An immediate corollary of Fact 12.51 21 is that in a distal structure, for 
any formula 4>{x, y) there is a formula ip'iVi 2/i) • ■ •) Vn) such that for any finite B C 
MIJ'I with \B\ > 2 and a € there are some 6i,..., S i? such that (j){a, B) = 
'ip'{B,bi,... ,bn). Namely, one can take V''(2/, 2/i, ■ • ■, 2/n) = Va;(V’(ai, yi,..., i/n) -t 
(j){x,y)). In fact, this corollary characterizes NIP (see [5] for the details). 

We list some examples of distal structures (providing more details than we nor¬ 
mally would, for the sake of a non model-theorist reader). 

2.3.1. 0-minimal structures. A structure M = (M, <,...) is o-minimal if every 

definable subset of M is a finite union of singletons and intervals (with endpoints 
in M U {±oo}). From this assumption one obtains cell decomposition for definable 
subsets of M", for all n. Moreover, a cell decomposition of a definable set is 
uniformly definable in terms of its definition (see |41) for a detailed treatment 
of o-minimality, or [361 Section 3] and references there for a quick introduction). 
Examples of o-minimal structures include R = (R,-f, x), Rexp = (R,+, x,e^), 
Ran = ^)/ ([0 1 ]'=) / ranging over all functions real-analytic on some 

neighborhood of [0,1]^, or the combination of both Ran.exp- It is straightforward 
to verify that if Ai is o-minimal then it satisfies Fact 12.51 21 for all formulas (j){x, y) 
with |a;| = I. 

Example 2.8. The field of reals R. 

By Tarski’s quantifier elimination, for each n the definable subsets of R" are 
exactly the semialgebraic sets, namely finite Boolean combinations of sets defined 
by polynomial equations p{xi,... ,Xn) = 0 and inequalities q{xi,.. .Xn) > 0 for 
p{x),q{x) S R[a:i,..., x„]. It is o-minimal, and so distal. 

2.3.2. Ordered dp-minimal struetures. More generally, it is proved in [38] that ev¬ 
ery ordered dp-minimal structure is distal (see Fact EH). Examples of ordered 
dp-minimal structures include weakly o-minimal structures and quasi-o-minimal 
structures. An ordered structure M. is weakly o-minimal (quasi-o-minimal) if in 
every elementary extension, every definable subset is a finite union of convex subsets 
(respectively, a finite boolean combination of singletons, intervals and 0-definable 
sets E]). 

Example 2.9. Z as an ordered group. 

By Presburger’s quantifier elimination, for each n the definable subsets of Z” are 
finite boolean combinations of sets of the following types: 

{ {xi , . . . , Xn^ € R . QiXi -f • * * -f ajiXji — Uq} ) 

{ (^Xi , . . . , Xtt.) € R . a^Xi -f * * * -f ^ , 

'S'a = {(a^D ... ,a:„) G R" : 3y e Zfcy = oo + aiXi H-h a„x„}, 

for d = (oq, oi,..., Un) G Z"+^ and fc G N. This structure is quasi-o-minimal (see 
El Example 2]). 
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Example 2.10. The valued field K = UnsN of Puiseux power series over 

K, in the language Ldiv = {0,1, <, +, —, x,v{x) < v(y)}. 

Using quantifier elimination from |12] . for each n the definable subsets o/K” (in 
the language of valued fields) are finite boolean combinations of sets of the following 
types: 

Sp = {(a^i,... ,a;„) G K”: p(a;i,. ..,Xn)= 0}, 

Sp = {{xi,...,Xn) G K": p{xi,...,Xn) > 0}, 

^p,q = G K”: v{p{xi ,... ,a;„)) > v{q{xi ,... ,x„))}, 

for p,q € K[a;i,..., Xn] and k gN. 

This structure is a model of the complete theory RCVF of real closed fields 
equipped with a proper convex valuation ring, and by [T^ it is weakly o-minimal. 

2.3.3. P-minimal structures with definable Skolem functions. 

Example 2.11. By a result of MacIntyre [28] the field of p-adics Qp eliminates 
quantifiers in the language Lp = {0,1,+, x,v{x) < v{y),Pn{x)}, where for n > 2 
we have Pn{x) <++> 3y{x = y^). It follows that for each n the definable subsets of 
Qp are finite boolean combinations of sets of the following three types: 

Sp = {{xi ,... ,Xn) G M”: p{xi,... ,Xp) = 0}, 

Sp,q = {(a;i,... ,ai„) G K”: v{p{xi,.. .,Xn)) > v{q{xi ,... ,x„))}, 

Sp = {{xi ,... ,x„) G K”: 3?/ G Qp = p{xi,.. .,Xn), 
for p, q G Qp[xi,..., Xn] and fc G N. 

Similarly to the o-minimal case, there is a notion of minimality for expansions 
of Qp. Namely, a structure in a language L Q Lp is p-minimal if in every model 
of Th(M), every definable subset in one variable is quantifier-free definable just 
using the language Lp |20j . P-minimal structures with an additional assumption of 
definability of Skolem functions satisfy an analogue of the p-adic cell decomposition 
of Denef. A motivating example of a p-minimal theory with definable Skolem 
functions is the theory pCFan of the field of p-adic numbers Qp expanded by all 
sub-analytic subsets of Zp [42] . 

By m Corollary 7.8], every p-minimal structure with definable Skolem func¬ 
tions is dp-minimal. Since (Qp,+, x,0,l) is distal [38] (can be also verified using 
Fact l6.6ll . it follows by R emark lb.7l that any p-minimal theory with Skolem functions 
is distal. 

2.4. Keisler Measures. Let AI be a structure. 

Recall that a Keisler measure on M" is a finitely additive probability measure on 
the Boolean algebra of all definable subsets of M”, i.e. it is a function p, that assigns 
to every definable X C M” a number p(A) G [0,1] with p(0) = 0, p(M") = 1 and 

p(A u y) = p(A) + p(y) - p(A n y) 

for all definable subsets X,Y C M”. Given a formula fiix) with parameters from 
M and a Keisler measure p on we will write fj,{cj){x)) to denote p((()(Ml“l)). 

In this paper we will deal mostly with the so-called generically stable and smooth 
Keisler measures. 

Generically stable measures on M" are defined as Keisler measures on M" ad¬ 
mitting a (unique) global M-invariant extension which is both finitely satisfiable 
in M and definable over M (see Remark ] 5. 1211 . In the NIP case, according to the 
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following fact from [22l Theorem 3.2] (see also [39l Section 7.5]), they can also be 
defined in terms of the structure j\4 alone without mentioning global measures. 

Fact 2.12. Let A4 be an NIP structure and /i a Keisler measure on . Then the 
following are equivalent. 

(1) The measure fi is generically stable. 

(2) For every formula 4>{x, y) with |a;| = k and £ > 0 there are some Oi,..., S 

such that \iJ.{(j){x, b)) — Av{ai ,..., Om', 6)) | < £ for any b G 

The VC-theorem implies that in NIP theories, for any Keisler measure, a uni¬ 
formly definable family of sets admits an £-approximation by types (see |211 Section 
4]). Fact 12.121 21 implies that with respect to a generically stable measure, there 
are £-approximations by elements of a model rather than just by types over it. 
We remark that the bound on the size of £-approximations depends just on the 
VC-dimension of the formula (so uniform over all generically stable measures). 

Proposition 2.13. Let M be an NIP structure. Then for any k G uj and any 
£ > 0 there is some n = 0{k{^)^ log |) such that: for any formula (j){x,y) of VC- 
dimension at most k and any generically stable measure y on there are some 

oi,..., a„ G such that for any b G \fj,{cj){x, b)) — Av{ai ,..., a„; ((>(x, b))\ < 

£. 

Proof. Let n G lo he given for k and e by Fact 12.21 Let (j){x, y) be a formula of VC- 
dimension at most k and y. an arbitrary generically stable measure on By 

Fact I 2 .I 2 I 2 I there are some a'^,..., a(„ G such that for any b G Mis'! we have 
\y{(j){x,b)) — iy{(j){x,b))\ < £/2, where iy{(j){x,b)) = AY{a[,..., a'^^; (j){x,b)). Now 
applying Fact 12.21 to i/, we find some Oi,..., a„ G such that for all b G M, 

\v{4){x, b)) — Av(ai,..., a„; (j){x, &))| < e/2. Then 

luicfix, b)) - Av(ai, ...,an\(j>{x, 6))| < £ 

for all b G M\y\, as wanted. □ 

Remark 2.14. Encoding several formulas into one we can replace a single formula 
(l){x,y) in Proposition 12. 1 31 bv a finite set of formulas A{x,y). 

Recall that a Keisler measure y on M" is called smooth if there is a unique 
global Keisler measure extending it. The following equivalence can be used to 
avoid a reference to global measures in the definition of smoothness. 

Fact 2.15 f [22L Section 2]). A Keisler measure y on M" is smooth if and only if 
the following holds. 

For any formula 4>(x, y) with \x\ = n and £ > 0 there are some formulas 9} (x), Of {x) 
and tpiiy) with parameters from M, for i = 1,..., m, such that: 

(1) the sets partition 

(2) for all i and b G if Ai \= tpiib), then 

h />(x,b))&e(<i>(x,b) 9/(x)), 

(3) for each i, y{6/{x)) — y{9}{x)) < e. 

Every smooth measure is generically stable, and there are generically stable 
measures which are not smooth (though every Keisler measure in an NIP theory 
can be extended to a smooth one, but over a larger set of parameters). However, 
we have the following characterization from [38] . 
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Fact 2.16. Let T he NIP. Then the following are equivalent: 

(1) T is distal. 

(2) For any model M of T, any generically stable measure on M” is smooth. 

Remark 2.17. Let fj, he a Keisler measure on M" and A C M” a definable subset 
with n{A) > 0. Then we can localize ^ to A by defining fiA { X ) = /i(A fl X)/fj,{A). 
Clearly fiA is a Keisler measure on M" and fXA is generically stable (smooth) 
provided /r is. 

Let Aihe a structure, fii a Keisler measure on and fj ,2 a Keisler measure on 
M”. A Keisler measure /i on is called a product measure of fj,i and fi 2 if for 

any definable subsets X C M™, Y C M” we have fi{X xY) = fj,i {X)fj, 2 {Y). We can 
extend this notion to finitely many Keisler measure iii on in an obvious way. 

A product Keisler measure always exists but in general is not unique. However, for 
smooth measures we have the following proposition that follows from [221 Corollary 
2.5]. 

Proposition 2.18. Let A4 be a structure, ni a smooth Keisler measure on M'^ 
and fi 2 H smooth Keisler measure on M". Then there is a unique product measure 
of fj .1 and fj ,2 o-nd this measure is also smooth. 

In the case of smooth measures pLi and fj .2 we will denote their unique product 
measure as p, = pi® yL 2 . 

Let xi,... ,Xn be pairwise disjoint tuples of variables, and p a Keisler measure 
on X • • • X . Then for each i = 1,... ,n, p induces a Keisler measure pi 

on by 

p,{Y) = X • • • X X F X X • • • X 

and we will denote this pi by p\xi- It is easy to see that if p is generically stable 
(smooth) then every p]^^ is also generically stable (smooth). 

Also in this case we will call a Keisler measure p on x • • • x a product 

measure if /x is a product of p\xi, ■ ■ ■ 

Finally, we give some examples of smooth Keisler measures. 

Fact 2.19. (1) Any Keisler measure concentrated on a finite set (as Fact \2.lW 2) 
is clearly satisfied). 

(2) Let A„ be the Lehesgue measure on the unite cube [0, Ij” in R". Let M. be an o- 
minimal structure expanding the field of real numbers. If X f- R" is definable in 
j\4, then, by o-minimal cell decomposition, X n [0,1]" is Lehesgue measurable, 
hence A„ induces a Keisler measure on M". This measure is smooth by [22l 
Section 6]. 

(3) Similarly to (2), for every prime p a (normalized) Haar measure on a compact 
ball in Qp induces a smooth Keisler measure on Q]) (see |22l Section 6]). 

(4) Any definable, definably compact group G in an o-minimal structure or over the 
p-adics admits a unique G-invariant generically stable measure muz], which 
is then smooth by distality and Fact \2.lR 

3. Strong Erdos-Hajnal for definable bi-partite graphs in distal 

THEORIES 

In this section we prove the key result of this paper. 
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Theorem 3.1. Let Ai be a model of a distal theory, and R C M" x a definable 
relation. Then there is a constant S = S{R) > 0 and a pair of formulas 
'f’ 2 iy-,Z 2 ) such that for any generically stable measures on M" and re¬ 
spectively, there are Ci,C 2 from A4 with ci)) > 6, /i 2 (V' 2 (-W, C 2 )) > S, and 

the pair A = ci), B = ip 2 {M, C 2 ) is R-homogeneous. 

As in [ 2 ] the above theorem will follow from an asymmetric version (see Theorem 
13.61 below). 

Let be a distal structure and we fix a formula 4>{x, y). Let ipix, yi,... ,yi) be 
as given for (j){x, y) by Fact 12.51 

For d = {di,... ,di) we will denote by the subset of defined by '0(x, d) 
and call it a chamber. If in addition d G then we say that Cj* is a B-definable 
chamber. 

Definition 3.2. For a chamber C = and b € we say that (j){x, b) crosses 
C if both C n (^(M, b) and C fl b) are nonempty. 

For a chamber C and a set B we will denote by (B) the set of all 5 S S such 
that (j){x, b) crosses C. Note that C"^{M) is a definable set (by a formula depending 
just on the formula defining C). 

Definition 3.3. For B C a chamber C is called B-complete if C is B- 

definable and C*{B) = 0. 

It follows from the choice of tp that for every finite B C and a € there 
is a B-complete chamber C with a G C. In particular, for any finite B the union 
of all B-complete chambers covers 

Definition 3.4 (1/r-cutting). Adopting a definition from [31], we define 1/r- 
cutting as follows. 

Let v he a, Keisler measure on For a positive r S K we say that a family 

of chambers is a 1/r-cutting with respect to v if is covered by {C: C G R} 
and for every C G if we have v{C"^ {M)) < A 

The following claim is an analogue of a cutting lemma from m (see also Exercise 
10.3.4(b) there). 

Claim 3.5. There is a constant K such that the following holds. For any positive 
r and for any generically stable measure v on Afl*'! there is a finite set SQM such 
that the family of all S-complete chambers is a 1/r-cutting with respect to v, and 
the size of S is bounded by Kr^ log2r. 

Proof. Consider the family of sets 

C = {C'^(M): C is an M-definable chamber}. 

It is a definable family, hence has a bounded VC-dimension by NIP. Applying 
ProDOsition l2. 131 with e = 1/r we obtain a subset S' C M of size at most Kr^ log2r, 
where AT is a constant that depends only on the VC-dimension of C, such that for 
every M-definable chamber C if {M)) > 1/r then S fl C"^{M) A 0. 

Since C'^(S) = 0 for any S-complete cell C, we are done. □ 

The following theorem is an analogue of a result in [3J Section 6]. 
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Theorem 3.6. Let M be a distal structure and let R{x,y) be a definable relation. 
Then for any (3 £ (0, there are some a £ (0,1) and formulas zi), ip 2 {y, Z 2 ) 
depending just on R and fi such that: 

for any Keisler measure fi on and any generically stable measure v on 
there are some ci £ with ij,{ijji{x,ci)) > a, u(i/) 2 (y,C 2 )) > fi and 

the pair of sets '0i(M, ci), i/' 2 (Af, C 2 ) is R-homogeneous. 

Proof. Let 4>{x, y) be a formula defining R. Let r be a positive real number that 
we will determine later. 

By Claim 13751 let S C be a set of size at most Kr^\og2r such that for 
every S'-complete chamber C, < i. It is not hard to see that there is a 

constant Ki and a number I = Z(i/;) £ N such the number of S'-definable chambers is 
at most Ki\S\f Thus the number of S'-complete chambers is at most K'r^’’ log^ 2r, 
where K' is a constant. 

As the set of S-complete chambers covers there is an S-complete chamber 
Co with y{Co) > 

For the set D = \ Cf{M), we have v{D) > (1 — y) and for every d £ D, 

(p{x,d) does not cross Cq. In particular all a £ Co have the same ^-type over D. 
Note that is a disjoint union oi Di = {d £ D : Co ff d)} and D 2 = {d £ 
D : Cor\ (^{M, d) = 0}, and both (Co, Di) and (Co, D 2 ) are i?-homogeneous. Thus 
either v{Di) > ^ ^ or ^(£> 2 ) > 5 — Let ifi := ■)/> be the formula such that an 

instance of it defines Co, and let 1/12 be the formula such that an instance of it defines 
either Di or D 2 , depending on which one has large measure. By assumption there 
are only finitely many choices for both depending on the original data. So given 

fi £ ( 0 , i) we can find r with = /3, and take any positive a < - -. -. 

r log ( 2 r) 

Then, encoding finitely many choices for tpi , i />2 into one formula we can conclude 
the theorem. □ 

Proof of theorem HOI We can take any fi £ (0, i) and let a be as in Theorem 13.61 
Now take S = min{a, /?}. □ 

Remark 3.7. We will see in Corollary 14.61 that one can allow an extra parameter in 
R without affecting the uniform choice of ^i,' 02 - 

4. Density version and a generalization to hypergraphs 

First we prove that Theorem 13.11 can be strengthened to a density version. It 
seems that this implication is folklore, as it is mentioned in Corollary 7.1 and 
the remark afterwards] without definability of the homogeneous subsets and stated 
in m- However, the proofs in both places are very sketchy, so we give a complete 
proof verifying definability of homogeneous subsets, and in addition working with 
Keisler measures. Our argument is an elaboration on the proof of Theorem 3.3 in 

m- 

Proposition 4.1. Let M be a distal structure and R{x,y) a definable relation. 
Given a > 0 there is e > 0 such that for any Keisler measure fi on any 

generically stable measure v on and a product measure uj of p, and v, if 

uj{R{x,y)) > a then there are uniformly definable (in terms of a and R only) 
Ao C and Bo C M^y^ with p{Aq) > e, v{Bo) > e, and Ao x Bq C R. 
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We fix a distal structure A4 and a definable relation R{x,y). By Theorem 13.11 
we know that there is a constant (5 > 0 and formulas ipiix, zi),'ip 2 {y, Z 2 ) such that 
for any measure y on and any generically stable measure v on there are 
some A C and B C definable by an instance of 'ipi and ■02 respectively, 
with ii{A) > S and v{B) > <5, such that either AxBCR or AxBr\R = ^. 

Now we fix Keisler measures y,, v as in the proposition and let a; be a product 
Keisler measure of /i, v on 

For definable sets A C i? C we denote by d{A, B) the density of R in 
Ax B, namely 


d{A,B) = 


uj{{A X B) ni?) 


y.{A)v{B) 

and setting d{A,B) = 0 if y{A)v{B) = 0. The following claim is a basic step. 


Claim 4.2. Tet A C B C be definable sets with d{A, B) > 1 —. Then 

there are subsets Ai Q A, Bi Q B defined uniformly (in terms of A, B and R) 
such that fJ-{Ai) > Sfi{A), v{Bi) > 5v{B) and x Bi C R. 


Proof. Using Remark |2. 171 we apply Theorem 13.II to yLA,VB — the localizations of 
y OT\ A and v on B, respectively. This gives us A' C B' C defined by 

instances of 0i and 02 respectively, such that for the sets Ai = A' r\ A, Bi = B'dB 
we have n{Ai) > 6yi{A), v{Bi) > 5v(B) and either x Bi C B or {Ai x Bi)C\R = 

0 . 

If (Ai X Bi) n i? = 0 then 

u{{A X B) n B) < uj{A X B) - oj{Ai x Bi) < (1 - 5‘^)yL{A)v{B), 
contradicting the assumption d{A,B) >1 — 6^. □ 


It is not hard to see that Proposition 14.11 follows from Claim 14.21 and the fol¬ 
lowing claim by iterating sufficiently (but boundedly) many times and taking the 
conjunction of the corresponding defining formulas. 

Claim 4.3. For any 0 < a < 1 — there is some h > 0 such that for any 
definable A and B with d{A, B) > a there are uniformly definable (in terms of 
R, a, A, B) subsets A’ C A B' C B with yi[A') > hyi[A), v[B') > hv{B) and 

d{A',B')>a^^. 

Proof. We pick d G (0,1) to be determined later. 

We choose Aq C A, Bq C B homogeneous with respect to B and with ^{Aq) > 
5yL{A) and i^(Bo) > 5v{B) (applying Theorem 13.61 and Remark [2.171 to — 

the localizations of p. on A and v on B, respectively). 

If Aq X Bo C B then we take A' = Aq, B' = Bq, h = 6 and we are done. So 
assume 


(4.1) (AoxBo)nB = 0. 

Let oo = and bo = —Let a' = d(A,B), so a' > a. 

y{A) v{B) 

From (j4.1|l it follows that aobo < 1 — a' < 1 — a. In particular at least one of oq 
or bo is at most 1 /I — a. 

We assume oq < ^/l — a. 


Let Ai = 7l \ Aq and Bi = B \ Bq. 
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Case 1: v[Bq) < diy{B). 

For definable A! A and B' C B we write lu{R{A', B')) for uj{[A' x B') fl R). 
Since there are no i?-edges between Aq and Bq we have 

(4.2) Si)) + w(i?(Ai, Bo)) + a;((Ai, Bi)) = u:{R{A, B)) > afi{A)iy{B) 
We also have 

(4.3) ^(^o)u(Bi) + fj.{Ai)iy{Bo) + ^(^i)u(Bi) 

= di^)v{B) - fi{Ao)iy{Bo) < /r(A)u(B)(l - S^) 

A very simple combinatorial statement is that if ri+r 2 +r 3 > r and S 1 +S 2 +S 3 < s 

then there is z S {1, 2, 3} with — > -. 

Si s 

So in this case we can choose A' G {Aq, Ai} and B' G {Bq, Bi} such that 

d{A',B')>a^^, 

and fJ-{A') > hfj.{A), u(B') > hv(B), where 

h = min{(5, (1 — d),l — \/l — a}. 

Case 2: v[Bq) > dv{B). 

In this case we will take A' = Ai = A \ Aq and B' = B. As above, let Bi — B\Bo. 
Let d' = 1 — d. 

The maximal possible measure of the set of B-edges between Aq and B is 
w(i?(Ao, B)) = u}{R{Ao, Bi)) < ^(Ao)u(Bi) < \/l — ad'^{A)v{B). 

Thus for the measure of the set of B-edges between Ai and B we obtain 
a;(B(Ai,B)) > afj,{A)i^{B) — y/T^^ad'^{A)v{B). 

Since ^(Ai)u(B) < (1 — 5)^{A)v(B) we obtain 

, , uj{R{Ai,B)) a-y/l-ad' I - y/1- a{d'/a) 

= lUJiRW - —TVl— = “-rvj-■ 

As d' decreases to 0+, the right side of the above inequality goes increasingly to 

a --. Since 0 < <5 < 1 we have that --r <--. So we can choose d G (0,1) 

1 — 0 1 — 0^1 — 0 

so that for d' = 1 — d the right side is at least a - 

1 — 0 ^ 

Combining the two cases together we take 

h = min{(5, (1 — d), 1 — y/1 — a}. 

Uniform definability of A', B' in all the cases follows from the uniform definability 
of Ag, Bq and construction, so as always we can encode finitely many formulas into 
a single one. □ 

Now we can use this proposition inductively to prove the analogue of Proposi¬ 
tion |4A] for hypergraphs, essentially following the proof of [Ml Theorem 8.2]. 
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Proposition 4.4. Let M be a distal structure and R{xo,... ,Xh-i) a definable 
relation. Given a > 0 there is e > Q such that: given a generically stable product 
measure uj on TVfl^ol X X • • • X with uj{R) > a there are definable sets 

with u}\xi{Ai) > e for all i < h such that Yli<h Moreover, each 

Ai is defined by an instance of a formula that depends only on R and a. 

Proof. Let h > 2 he given, and assume inductively that we have proved the propo¬ 
sition for all i < h. Let R{xq, ... ,Xh) and a > 0 be given. Let a; be a generically 
stable product measure on x • • • x Applying ProDOsition Id.ll with h = 2 

to the binary relation R{xq]Xi, ..., Xh) we find some e' > 0, Aq with a;|a;o(Ao) > e' 
and A C x • • • x with w|xi....,a:h(A) > e' such that R holds on all ele¬ 

ments of Aq X A (the corresponding projections of ui are clearly generically stable). 
Moreover, A = R 'x ••• x for some uniformly definable (depending 

only on R and a) relation R'. We apply the inductive assumption to R' with h—1, 
a = s' and which gives us some e" > 0 and uniformly definable sets 

Ai,l < i < h with uj\xi{Ai) > e" and such that Ai x • • • x Ah C R', which implies 
Aq X Ai • • • X Ah C R. Take e = min{e',e"}. All the data is chosen uniformly 
depending only on R, a. □ 

The density version implies a generalization of Theorem l3.6l for hypergraphs. 

Corollary 4.5. Let A4 be a distal structure and R{xq, ■. ■ ,Xh-i) a definable rela¬ 
tion. Then there is 6 > 0 such that for any generically stable measures pLi on 
there are Ai with pifiAf) > 5 for all i < h, uniformly definable in terms of R, and 
such that either Y[i<h Ai C R or ni<?i A^ fl i? = 0. 

Proof. Since a product of generically stable measures is generically stable, the mea¬ 
sure a; = /xq 0 • • • 0 Ph-i is generically stable, and either uj{R) > ^ or oj{-'R) > i. 
Applying Propositionwith a = ^ to i? and to -•R we obtain some £i, £2 respec¬ 
tively. But then S = min{£i,e 2 } satisfies the conclusion. □ 

Besides, the formulas defining homogeneous subsets can be chosen depending 
just on the formula defining the edge relation, and not on the parameters used 
(in the semialgebraic setting this corresponds to saying that the complexity of the 
homogeneous subsets is bounded in terms of the complexity of the edge relation, 
and does not depend on the choice of the coefficients of the polynomials involved). 

Corollary 4.6. Let A4 be a distal structure and 4 >(xq, ... ,Xh-i,y) a formula. 
Given a > 0 there is e > 0 such that: for a definable relation R{xq, ... ,Xh-i) = 
4 >{xq, ... ,Xh-i,c) with some c € and a generically stable product measure w 

on X X • • • X with uj{R) > a there are definable sets Ai C 

with u}\xi{Ai) > £ for all i < h and ni</i Ai C R. Moreover, each Ai is defined by 
an instance of a formula that depends only on </> and a. 

Proof. Follows immediately by Proposition 14.41 applied to the relation 

R'{xo,...,Xh-i,y) = (fixo,... ,xh-i,y) 

and to the generically stable product measure uj' = u® 5c, where 5c is a (generically 
stable) {0, l}-valued measure on concentrated on c. □ 

Example 4.7. Let An be the Lebesgue measure on M" restricted to the unit cube, 
i.e. A„(A) = A„(A n/„) where A„ is the standard Lebesgue measure and /„ is the 
unit cube in K". 
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Let TZ be an o-minimal expansion of R and R{xi ,..., x™; m) be a formula. Then 
for any a > 0 there is some c > 0 such that for any c € with A„(i?(R"; c)) > a 
there are definable Ai C R, i = 1,... ,n with Ai(Ai) > e and Ali x • • • x An C 


This follows from Corollary 14.61 and Fact 12.191 

Also we get a generalization of the original semialgebraic counting version over 
finite sets from Theorem O with additional control on the parameters over which 
the homogeneous subsets are defined. 

Corollary 4.8. Let A4 be a distal structure and let a formula (j){x,y,z) be given. 
Then there is some 5 = 5{4)) > 0 and formulas 'ipi{x,zi) and 'ip 2 {y,z: 2 ) depending 
just on (j) and satisfying the following. For any definable relation R{x, y) = 4>{x, y, c) 
for some c G and finite A C B C there are some A' C A^ B' F B 

with \A'\ > (5|A|, \B'\ > d\B\ and 

(1) the pair A\ B' is R-homogeneous, 

(2) there are some ci € A^^^^ and C 2 G such that A' = tjji{A,ci) and B' = 

1/^2 (B,C2). 

Proof. Let ifi{x, Zi),'if 2 {x, Z 2 ),e be as given by Corollary 14.61 Then the existence 
of A',B' follows by defining y{X) (u(X)) to be the normalized number of points 
in X n A (resp., X (1 B). Such Keisler measures are always generically stable by 
Fact 12.1^ 

For Part (2), by Remark [2771 we can find some formulas ip[{x,z[) and ^ 2 ( 2 ^) 
such that for any finite sets A,B and ci,C 2 there are some c'^ G AI^iI,C 2 G 
such that Cl) = ip[{A,c'i) and 'fi 2 {B,C 2 ) = il’ 2 {B,C 2 ). □ 

We will show in Section [6] that the most basic version of Corollary |T8] character¬ 
izes distality. This is not the case however if we do not require definability of the 
homogeneous subsets. 

Remark 4.9. (1) If every definable relation in A4 satisfies the strong Erdos-Hajnal 
Property and Af is interpretable in M, then every definable relation in Af 
satisfies the strong Erdos-Hajnal Property. 

(2) Let A4 and Af be two structures in the same language and assume that Af 
embeds into A4 . If all quantifier-free definable relations in A4 satisfy the strong 
Erdos-Hajnal Property, then all quantifier-free definable relations in Af satisfy 
the strong Erdos-Hajnal Property as well. 

By the remark and Corollary 14.81 we have the following. 

Corollary 4.10. If AA is distal and Af is interpretable in AA (embeds into AA), 
then all definable (resp., guantifier-free definable) relations in Af satisfy the strong 
Erdos-Hajnal Property. 

Example 4.11. The following relations satisfy the strong Erdos-Hajnal Property. 

(1) Definable relations in an arbitrary algebraically closed field of characteristic 0 
(since ACFq is interpretable in the distal theory of real closed fields RCF). 

(2) Definable (in the language Ldiv) relations in an arbitrary non-trivially valued 
algebraically closed field of residue characteristic 0 (since its theory ACVEo,o 
is interpretable in the theory of real closed valued fields RCVF (see e.g. [H 
Corollary 6.3]), which is distal in view of Examr)le l2.10l) . 
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(3) Quantifier-free definable relations in an arbitrary field of characteristic 0 (as it 
can be embedded into some model of ACFq). 

(4) Quantifier-free definable (in Ldiv) relations in an arbitrary valued field of equichar- 
acteristic 0 (as it can always be embedded into a model of ACVFo,o)- 

Thus (see Remark 11.41) we obtain many new families of graphs satisfying the 
Erdos-Hajnal conjecture. 

Remark 4.12. (1) Every relation satisfying the strong Erdos-Hajnal property is 
NIP. 

(2) If all definable relations on A4 satisfy the Erdos-Hajnal property then A4 is 
NIP. 

Proof. (1) If the relation R{x,y) is not NIP, then for any finite bi-partite graph G 
there are some A C C such that G is isomorphic to (A, B, Rr\{Ax B)). 

By the optimality of the bound 0(log n) on the size of homogeneous subsets in 
arbitrary bi-partite graphs it follows that R does not have the strong Erdos-Hajnal 
property. 

(2) If the relation R{x,y) is not NIP, let R' C x be defined by 

R'{ab,cd) R{a,d) V R{c,h). This is a symmetric relation such that for any 

finite graph G there is some set A C such that G is isomorphic to (i?', A) 

(see e.g. [26l Lemma 2.2]). Again optimality of the logarithmic bound for arbitrary 
graphs implies that R' does not have the Erdos-Hajnal property. □ 


5. Regularity lemma for distal hypergraphs 


5.1. Regularity lemmas for restricted families of graphs. Szemeredi’s reg¬ 
ularity lemma is a fundamental result in graph combinatorics with many versions 
and applications in extremal combinatorics, number theory and computer science 
(see [24] for a survey). In it’s simplest form for bi-partite graphs, it can be presented 
as following. 


Fact 5.1. If s > 0, then there exists K = K (e) such that: for any finite bi-partite 
graph R Q A x B, there exist partitions A = Ai U ... U A^^ and B = BiU .. .U Bk^ 
into non-empty sets, and a set S C {l,...,fci} x {l,...,fc 2 } with the following 
properties. 


(1) Bounded size of the partition: ki,k 2 < K 


(2) Few exceptions: 




> (l-e)|AxH| 


(3) s-regularity: for all (i,j) G E, and all A' C Ai,B' C Bj, one has 


\\Rn{A'xB')\-d,,\A'\\B'\\<e\A\\B\, 


where dij = 


\Rn{AiXBi)\ 

\M\BA 


In general the bound on the size of the partition K is known to grow as an 
exponential tower of height i, and the result is less informative in the case of 
sparse graphs. Recently several improved regularity lemmas were obtained in the 
context of definable sets in certain structures or in restricted families of structures. 


(1) [20] Algebraic graphs of bounded complexity in large finite fields (equivalently, 
definable graphs in pseudofinite fields): pieces of the partition are algebraic 
of bounded complexity, no exceptional pairs, stronger regularity. Some gen¬ 
eralizations and simplifications were obtained in [31 dSI and by Hrushovski 
(unpublished). 
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(2) [27] Graphs of bounded VC-dimension: density arbitrarily close to 0 or 1, the 
size of the partition is bounded by a polynomial in (i). 

(a) [23] Graphs without arbitrarily large half-graphs, corresponding to the 
case of stable graphs (no exceptional pairs). 

(b) [TU [18] Semialgebraic graphs of bounded complexity. 

We remark that the classes of structures in (1), 2(a) and 2(b) are orthogonal to 
each other. In the next section we give a generalization of the case 2(b) to graphs 
definable in arbitrary distal structures. As remarked before, the stable and the 
distal cases present two extremal cases of general NIP structures. 

5.2. Distal regularity lemma. We work in a model At of a distal theory T. We 
have sorts Si,... ,Sk (i.e. definable subsets of some powers of At) and a definable 
relation i? C x • • • x Sk- 

Notation 5.2. (a) Let S = Si x ■■■ x Sk- 

(b) We call a subset X C S a rectangular subset if it is of the form X = XiX - ■ ■ Xk- 

(c) For ACM and a finite set of formulas A = {Ai{xi,yi),i = a 

rectangular subset X = Xi x ■ ■ ■ Xk is called A-definable over A if each Xi is 
a finite Boolean combination of sets from {Ai{xi,a): a G A}. (In fact we will 
need only conjunctions of A^ and their negations, i.e. partial A-types.) 

(d) Given Keisler measures on each sort Si, for a rectangular definable X = 
Xi X ■ ■ ■ X Xk we set 

/i(A) = fj . i { Xi ) ■ y , 2 { X 2 ) • ... • fik { Xk ). 

(e) By a rectangular definable partition of S we mean a finite partition V oi S 
consisting of rectangular definable sets. 

(f) For rectangular definable partitions V, Vi of S we write V C Vi V refines 
Vi, namely for each X G V there is F S Pi with X CY. 

(g) Given Keisler measures on each sort Si, for a rectangular definable partition 
V of S, we define the defect of V to be 

def(P) := ^ y{X). 

X^V 

X is not i?-homogeneous 

Obviously, if Pi C P then def(Pi) < def(P). 

Proposition 5.3. There is some constant c = c{R) such that: for any e > 0 and 
any generically stable measures p-i on Si, for i = 1 ,.. .,k, there is a rectangular 
uniformly definable (in terms of R and e) partition V of S with |P| < (i) and 
def(P) < e. 

We give a proof of the proposition in several claims, essentially following the 
proof of [HI Theorem 1.3] but working with Keisler measures. 

Using Proposition 14.41 we know that the following holds. 

Claim 5.4. There is a constant S = S{R), and formulas Afixi, yi), i = 1,... ,k such 
that for any generically stable measures pi on Si there are ai,i = 1,... ,k, such that 
the sets Xi C Si defined by Ai{xi,ai) are R-homogeneous and pfiXfi > 5. 

Remark 5.5. Since we are going to keep track of the parameters used in A^ it 
is more convenient to assume that in the above claim yi = ■ ■ ■ = yk = y and 
ai = ■ ■ ■ = Ok = a. It can be always achieved by a concatenation of variables. 
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We fix S from the previous claim and let A = {Ai{xi,y), i = ..., fc}, where Ai 
are from the above claim. 

Claim 5.6. Let X be a definable rectangular subset of S with fJ.{X) > 0. Then 
there is some a € and a rectangular set Y which is A-definable over {a}, such 
that X (lY is R-homogeneous and ii{X fl y) > 6^ii{X). 

Proof. Apply Claim [5^ to measures y,i relativized to the sets Xi. □ 

Claim 5.7. LetV be a rectangular partition of S which is A-definable over a finite 
set A. Then there is a rectangular partition Vi which is A-definable over a finite 
set Ai with 

(1) \Pi\<{k + l)\V\, 

(2) lAil < |A| + |P|, 

(3) def(iPi) < (l-<5'=)def(iP). 

Proof. Let A G 7^ be non-homogeneous. We can partition it into (fc-l-1) rectangular 
subsets, all of them A-definable using one extra parameter ax, such that one of 
these subsets is i?-homogeneous and is of measure at least 5^p,{X). Namely, if 
X = n^=i Ai, by Claim [5Jl there are some Yi C Xi such that Y = 0^=1 Ai is an 
i?-homogeneous subset of X with p{Y) > 5^p.{X), and we take a partition of X into 
(fc -I-1) pieces given by the sets YiX .. .Yk and Ai... x Ai_i x (Ai \Yi)xYiX .. .xYk 
for alH = 1 ,..., fc. 

Replacing each non-homogeneous X £V with such a sub-partition we obtain Vi 
satisfying the requirements. □ 

Thus, by induction on n we can construct a rectangular partition of S which 
is A-definable over a finite set A„ and such that 
(1) def(iP„) < (1 - 5'=)", 
m\Vn\<{k + ir, 

(3) |A„| < + IV = 

log s 

In particular, given £ > 0, using (1) and (2) above, after N = -—;-^ = 

log (1 — 6^) 

(~ iog(iVgfe) ) log \ Steps we have def(T’Ar) < £ with \'Pn\ <{k + 1)^ < 2^^ < (i)‘=, 
where c = — ^ positive constants depending only on R. 

This finishes the proof of Proposition 15.31 

From the above fc-partite version we obtain a regularity lemma for hypergraphs. 

Theorem 5.8 (Distal regularity lemma). Let P C M'^ he a definable set and 
R{xi,... ,Xk) with \xi\ = d for all 1 < i < k be a definable relation. Then there is 
some constant c = c{R) such that the following holds. 

For any £ > 0 and for any generically stable measure p, on P, there is a partition 
P = Pi U ... U Pk with K = O ((-)'^) such that Pi’s are uniformly definable (in 
terms of R and s) and 

^k-{Pii) ■■■T{P^k) <£> 

where the sum is over all tuples {ii,...,ik) such that {Pi.^,... ,Pi^) is not R- 
homogeneous. 
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Proof. Let Vn, An, c, A be as given by the proof of Proposition [O] for Si = P and 
fj,i = fjL, for all 1 < i < fc. 

Using Fact 12.41 we obtain constants C 2 and C 3 depending only on R and such 
that the number of A-types over any hnite set A is bounded by C 2 |^|'^^. Finally, 
we partition P into realizations of complete A-types over An, say P = Uka 
follows that there will be at most C 2 (^)“^ parts. It is easy to see that this partition 
of P satisfies the homogeneity condition because the rectangular partition 

V := {Pii X •• • X 

refines Vn and def(PAr) < £• D 

5.3. Finding definable equipartitions. Normally in the conclusion of a regular¬ 
ity lemma one is able to choose parts of (approximately) equal measure. We give a 
sufficient condition for this in the definable setting. To simplify some expressions, 
given real numbers ri,r 2 and £ > 0 , we write ri r 2 to denote that |ri — r 2 | < £. 

Definition 5.9. We say that a structure Ai uniformly cuts finite sets if for every 
formula ^(x, y) and every £ > 0 there is a formula x(x, z) such that for any suffi¬ 
ciently large finite set A C any b € and any 0 < m < \(j){A, 6 )| there is 

snmp r C mirb that ™ 

some c fc AT sucn tnat \<i>{A,b)\ ~ U(a,&)| ■ 

Note that this is a property of Th(A4). 

Example 5.10. (1) Assume that there is a definable linear order x < y on M. 
Then clearly A4 uniformly cuts finite sets and x in Definition 15.91 can be chosen 
independently of </> and £ (using lexicographic ordering for subsets of M" for 
n > 1 ). 

(2) Let M = (Qp, -f, x), then M uniformly cuts finite sets. For subsets of M this 
follows from the fact that every ball in Qp is a disjoint union of exactly p balls, 
using which an argument similar to the proof that for an atomless measure, 
every set of positive measure contains subsets of arbitrary smaller measure, 
can be carried out up to e, in a number of steps bounded in terms of e (one 
can check using quantifier elimination in the p-adics that in this case x cannot 
be chosen independently of e). To extend this to subsets of M” for n > 1, note 
that if k is an infinite field and A C fc” is a finite set then there is a uniformly 
definable linear map /: fc" —>■ fc that is one-to-one on A. 

Proposition 5.11. Let Ai he a distal structure and assume that it uniformly cuts 
finite sets. Then for every formula 4>{x, y) and i5 > 0 there is some x(x, z) such 
that: for any generically stable measure y on Ai with /r({c}) = 0 for any singleton 
c € if 0 < a < ii[(j){x,a)) then we can find some b € with ii{(j){x,a) fl 

X{x,h)) a. 

Proof. Fix £ > 0 arbitrary, and let xix, z) be an arbitrary formula. As A4 is 
distal, it follows bv Fact 12.161 that u is smooth over AI. Let ^l(x),df(x),i/:i(y,z), 
i = 1,... ,n list all of the formulas over AI given by Fact 12.151 for each of (j){x, y) 
and (j){x, y) A x(x, z), with respect to y and e. Let B be the hnite Boolean algebra 
of subsets of generated by 0 = {0-(M) : z = 1,..., n, t = 1, 2}. Clearly the 
number of atoms in B is at most 4". By assumption every dehnable set of positive 
p-measure is inhnite. Then for all sufficiently large m G N we can choose a set 
C C All'll, \C\ = m such that for every atom A of B, \ — p(A)| < It then 
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follows that for every i G n}, t G {1, 2} we have ^(0-(M)). But 

by the choice of 6^, 9} this implies that for any set D from A(M) = a) : a G 

M} U a) n x(M, b) : a,b G M} we have m(-D)- 

Now let 0 < Of < ^ := fi{(j){M,a)) be given (if a G {0,/3} then there is nothing 
to do). Let e := and let x(x, z) be as given by Definition 15.91 for (j){x,y) and e. 
Take m sufficiently large (to be specified later), then for C with \C\ = m chosen as 
above with respect to e and x we have in particular Ri® /3. Let I := a)|. 

We may assume that there is some k G N,k < I such that j ^ (as a > 0, by 
choosing m sufficiently large we may assume that I is arbitrarily large). Then, using 
that both /3, 4 < 1 we obtain a Ri® /3y —y = —, so a —. 

By the choice of x{x,z), there is some b G such that 
which implies m’ \‘l>iC,a)nx{C,b)\ ^3e gy assumption 

on C this implies that n{(j){x, a) A xi^, b)) r:'^'^ a, i.e. n{(t>{x, a) A x{x, b)) r:'^ a. 

□ 

Remark 5.12. Recall that a global measure fj, is definable over a small model M if it 
is Aitt(U/M)-invariant and for every formula (j){x, y) G L and every closed subset X 
of [0,1], the set {q G S\y\{M) : yL{(j){x,b)) G X for any b G ^ <l{y)} is closed. 

It is finitely satisfiable if for every (p{x, b) G L(U) with yi{(f{x, b)) > 0 there is some 
a G such that ^ ff(a, b) holds. As mentioned before, in an NIP structure, 
a Keisler measure y, over M is generically stable if and only if it admits a global 
M-invariant extension which is both definable over M and finitely satisfiable in M 
(see [551 Theorem 3.2]). 

Corollary 5.13. A ssume that T uniformly cuts finite sets in such a way that x in 
Definition \5.9\ can be chosen independently of e (e.g. if A4 has a definable linear 
order). Then under the assumptions of Proposition \5.11\ we can choose 6 G U 
such that yi[4>(x,a) r\xix,b)) = a, where yi is the unique global Keisler measure 
extending y. 

Proof. As yi is generically stable over A4, it is in particular definable over M. 
That is, for every ^ > 0 the set {& G U : a — 5 < yi{(j){x,a) A x{x,b)) < a -I- 
is type-definable over A4 (and consistent). It then follows by compactness that we 
can find some b* gU with yi{(j)(x, a) A ipix, b)) = a. □ 

Corollary 5.14. Let M be a distal structure and assume that it uniformly cuts 
finite sets. Then in Theorem \5.8\ for any y satisfying in addition /^({c}) = 0 for all 
c G M‘^ and any S > 0 we can find a partition Pi,..., Pk with y{Pi) y{Pj) fox 
all 1 < i, j < K (and the parts Pi are uniformly definable in terms of R,e,S,x)- 

Proof. We are following the standard repartition argument (see e.g. [181 Proof of 
Theorem 1.3]). 

Let (P, R) be a fc-uniform hypergraph, and let P = Pi U ... U Pk be a partition 
of its vertices given by Theorem 15.81 for |, with K < ci (-) Fix (5 > 0 and y 
satisfying the assumptions, and we will find a new partition P = Qi U ... U Qk' 
satisfying the conclusion of the corollary for e and 5. 

Let K' = [l^AT], without loss of generality 0 < (5 < and fix an arbitrary 
0 < <5' < .^. Using Proposition [531] we can partition each Pi into Pi = 5^ U |J Qiy 
with y{Qij) for all j and the remainder y{Si) < Let now S = Ui ^cid 
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again using Proposition l5.11l we can partition S into S = TUlJ Uj with fi{Uj) 
and the remainder ii{T) < ^- As d' was sufficiently small compared to <5 and 
calculating the error we get /r(T) We claim that P = IJ UjUT is the 

required partition, re-enumerate it as P = Qi U... U Qk' ■ We still have that K' is a 
polynomial in i. Note that < ^, so there are at most K parts of the new par¬ 
tition contained in S. Hence the sum ^ /i(Qii) ■ • ■ over all tuples {ii,..., ik) 

for which not all of P^j,..., P^ are subsets of parts of the original partition is at 
most -I- 5)^ < K Jk^ ~ ~ f' Together with the as¬ 
sumption on the original partition it then follows that ■ ■ -fJ-iQik)-: where 

the sum is over all tuples (H, ■ • ■, ik) such that {Qi^ , • ■ •, Qik) is not P-homogeneous, 
is bounded by § + | < £■ 

It follows from the construction that the new partition is uniformly definable in 
terms of the old one and S, and thus uniformly definable in terms of P, £ and <5. □ 

Remark 5.15. Answering a question from an earlier version of our article, Pierre 
Simon had recently demonstrated that all distal structures uniformly cut finite sets. 

6 . Equivalence to distality 

6.1. Strong Erdos-Hajnal fails in ACFp. Fields of positive characteristic give 
a standard example of the failure of the strong Szemeredi-Trotter bound on the 
number of incidences between points and lines. In a personal communication Ter¬ 
rence Tao had suggested that it may also be used for the failure of the strong 
Erdos-Hajnal property in this setting, which turned out to be the case indeed. 

Let F be a held. For a set of points P C and a set of lines L in F^ we denote 
by /(P, P) C P X P the incidence relation, i.e. I{P,L) = {{p,l) S P x P : p G /}. 
As remarked in Example 14.111 every held F of characteristic 0 satishes the strong 
Erdos-Hajnal property with respect to quantiher-free formulas. In particular we 
have: 

Proposition 6.1. Let ¥ be a field of characteristic 0. Then there is a constant 
d > 0 such that for any finite (sufficiently large) set of points P C F^ and any finite 
(sufficiently large) set of lines L in F^ there are some Pq C P and Pq C P with 
|Po| > S\P\, |Po| > d|P| and /(Po,Po) = 0- 

We show that the assumption of characteristic 0 cannot be removed. 

Proposition 6.2. We fix a prime p, and let F = F“^®. Then the conclusion of 
Provosition \6.1\ fails in F. 

Proof. Assume towards a contradiction that F satishes Proposition 16.II 

Since every hnite held of characteristic p can be embedded into F, we obtain that 
the following would be true: 

Let Fq be a hnite held of characteristic p, of size q. Let P be the set of all points 
in Fg, and let P be the set of all lines in F^ of the form y = ax + b. Then there are 
Pq F P, Lq F L with |Po| > i5|P|, |Po| > d|P| and such that I{Po,Lo) = 0. 

We show that this is impossible. We have |P| = |P| = g^. Since Fg has size 

g, every line contains exactly g points, therefore |/(P, P)| = |P|g = g^. Notice also 
that every point belongs to exactly g lines in P. 

We hx k large enough so that ^ < 6, and let Sq — Since q= p'^ for some n, 
Soq is an integer for every q > p^. 
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Hence we can choose Pq C P with |Po| = and Lq C L with \Lo\ = So\L\ 
such that I{Po, Lq) = 0. Let Pi = P \ Pq, Li = L \ Lq. We have |Pi| = (1 — 
and \Li \ = (1 - So)q‘^. 

Consider I{Pq,L). Since every point belongs to exactly q lines in L we have 
\I{Po,L)\ = \Po\q = Soq^. Since I{Po,Lo) = 0, we have I{Po,L) = I{Po,Li), so 
\IiPo,Li)\= 6 oq\ 

On the other hand, from the Cauchy-Schwartz inequality (see e.g. [35l Page 1]) 
it follows that |/(Po,-^i)| < Li)\ + |Pop- 

Thus we have 

Soq^ < \/(i - + S^q‘^ = ^(1 - (5o)5og® + (i - 

Since (1 —<5o) < 1, the above inequality fails for large enough q — a contradiction. 

□ 

Corollary 6.3. Let K he an infinite field definable in a distal strueture M.. Then 
char{K) = 0. 

Proof. By Corollary 4.5] every infinite NIP field of characteristic p > 0 con¬ 
tains But then Ai cannot satisfy the strong Erdds-Hajnal property by the 

proposition above, contradicting distality. □ 

In particular the theory ACFp admits no distal expansion. No examples of NIP 
theories with this property were known until now. 

6.2. Equivalence to distality. In this section we assume some familiarity with 
NIP theories (see e.g. [39]) and recall some facts about distal theories. We fix a 
theory T and a big sufficiently saturated model U of T. Recall that a sequence 
{oi : i G I) of elements of M" indexed by a linear order I is indiscernible over a 
set of parameters A Q M ii for any ii < ... < ik and ji < ... < jk from / we 
have tp(aij,..., at^/A) = tp(ajj,..., Oj^fA). Given a linear order I, by a Dedekind 
cut in I we mean a cut I = + I 2 (i.e., I = Ii U I 2 , Ii r\ I 2 =0 and a < 6 for all 

a G Ii,b G I 2 ) such that Ii has no maximal element and I 2 has no minimal element. 
We denote by I* the reverse of the order on I. 

Fact 6.4. [5] Let T be a complete NIP theory. Then the following are equivalent: 

(1) T is distal (in the sense of Fact lK^) . 

(2) Every indiscernible sequence I C M‘^ in any model of M of T is distal. That 
is, for any two distinct Dedekind cuts of I, if some two elements fill them 
separately, then they also fill them simultaneously: if I = Ii + I 2 + I 3 o-nd we 
have some a and b from M‘^ such that both Ii + a +12 +13 and Ii +12 + b +13 
are indiscernible, then + a + I 2 + b + I 3 is indiscernible. 

Given an indiscernible sequence I = (oi : i G [0,1]), one defines the average 
measure p of I as the global Keisler measure given by /i(0(x)) = Ai({j G [0,1] : 
Oi 1= (((x)}) for all definable sets (({x), where Ai is the Lebesgue measure on [0,1]. 
It follows from NIP that this Keisler measure is well-defined, i.e. that the corre¬ 
sponding set of indices is measurable for every (j){x) with parameters from U. We 
say that c = {Ii,l 2 , t) is a polarized cut of / if / = /i -I -/2 is a cut of I and t G {1, 2} 
specifies whether it is approached from the left or from the right. It follows from 
NIP that for a polarized Dedekind cut c = (Ii, / 2 , t) and a set of parameters H C U 
we have a complete limit type of c over A denoted by lim(c/H) and defined by 
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(f){x) S lim(c/A) the set {* S /* : U |= (/>(ai)} is unbounded from above in 

case t = 1, or from below in case t = 2. 

Fact 6.5. Let T he NIP, let I be an indiscernible sequence and let fj, be the average 
measure of I. 

(1) The measure n is generically stable f \22\ Proposition 3.7]). 

(2) The support of p, is exactly the set of limit types of cuts of I. That is, if 
for some formula 4 >{x) we have pi{ 4 >(x)) > 0 then 4 >{x) € lim(c/U) for some 
polarized Dedekind cut c of I [38l Lemma 2.20]. 

(3) I is distal if and only if p. is smooth [38l Proposition 2.21]. 

Recall that a sequence {oi : i € I) is totally indiscernible if for any ii ^ ... ^ ik 
and ji ^ ... ^ jk from I we have tp(aji ,...,04 /A) = tp(aji ,... ,aj^/A) 

Fact 6 . 6 . |38j Let T he dp-minimal. Then it is distal if and only if no infinite 
indiscernible sequence is totally indiscernible. 

Remark 6.7. It follows that if T' is a dp-minimal expansion of a distal dp-minimal 
theory T, then T' is distal as well. 

Indeed, If T' expands T and (oi : i G I) is an infinite L'-indiscernible sequence, 
then it is in particular L-indiscernible, so not totally-L-indiscernible by distality of 
T, so of course not totally-L'-indiscernible. 

Fact 6.8 (Strong base change. Lemma 2.8 in [38]). Let T be NIP. Let I be an 
indiscernible sequence and A A I a set of parameters. Let {ci : i < a) be a sequence 
of pairwise-distinct polarized Dedekind cuts in I. For each i, let di fill the cut q 
(i.e., if ti = (/i,/ 2 ,t) then Ii-\-di-\-l 2 is indiscernible). Then there exist [d'^:i < a) 
in U such that: 

(V tp{{d'i)i<o,/I) = tp{{di)i<a./I), 

(2) for each i < a, tp{d]/A) = lim(ci/24). 

Finally, we will use the following finitary version of a characteristic property of 
NIP theories. 

Fact 6.9. Let (j){x,y) he an NIP formula. Then there are some k,N gN such that 
for any indiscernible sequence I = {at : i < n) from with n > N and any 

b G the set 4>{I,h) is a disjoint union of at most k intervals. 

Proof. Follows from the usual characterization of NIP via bounded alternation on 
indiscernible sequences (see e.g. [TJ Proposition 4]) plus compactness. □ 

Theorem 6.10. Let T be an NIP theory. The following are equivalent: 

(1) T is distal. 

(2) For any definable relation R{x,y) and any global generically stable measures 
/ii,/r 2 there are some definable X C nil'll, F C which are R-homogeneous 
and satisfy p.i{X) > 0 ,/r 2 (F) > 0 . 

(3) For any definable relation R(x,y) there is some 5 > 0 and some formulas 

ijji{x, zi),ip 2 {x, Z 2 ) such that for all finite A C C there are some 

d G i = 1,2 such that {ifiiA, ci)| > <5|A|, \'ip 2 {B, C 2 )| > and the pair 
of sets 'ipi{A,Ci),ip 2 iB,C 2 ) is R-homogeneous. 

Proof. (1) implies (2) and (3) follow from Corollaries 14.51 and 14.81 
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(2) implies (1). Assume that I = (ai)igi is a non-distal indiscernible sequence, 
with T = [0,1]. This means that / can be written as / = Ii + I 2 + h (where 
Ij = {ai : i € Ij) and Ti,X|, 12,13 are without last elements) in such a way that 
there are some c, d £ U such that Ii +C+I 2 +I 3 and Ii +/2 + d +/3 are indiscernible, 
but Ii + c + I 2 + d + I 3 is not. 

Then there is a formula x, l 2 ,y, I 3 ) with some finite /' C Ij, say /' = 
{ui : i G 21'),X' C Ij for j G {1,2,3}, such that 0(/(,a^-,Ofc,/s) holds for any 
I[ < j < I 2 < k < X 3 , but 1= l 2 i d, I'^)■ Let [ji,j 2 ] be some interval of 

X between X{ and X 2 , and [ki, k 2 \ some interval between I 2 and X 3 . Let J = {at : 
i G [ji,j 2 ]),K = {tti : i G [ki,k 2 ]). Let be the average measure of J, and ly the 
average measure of K (we may assume that both sequences are indexed by [0,1] by 
taking an order preserving bijection). Then both fj, and v are generically stable by 
Fact l6.5f ll. 

Now assume that X = ,^(U) and Y = x(U) are definable subsets of with 
> 0 and v{Y) > 0 , where ^,x are formulas with parameters in some small 
model M ^ I. By Fact 16.51 21 it follows that there is some polarized Dedekind cut 
c of J such that ^{x) G limj(c/M), and some polarized Dedekind cut d of K such 
that x{x) G limif(c)/M). 

It follows by compactness, indiscernibility of / and taking an automorphism of 
U that there is some c' filling c and d' filling d (separately, as cuts in I) such 
that -!(/)(/{, c',/ 2 , d',/s) holds. By Fact 16.81 we can find some c",d” such that still 
-!(()(/{, c",/ 2 , d",/g) holds, but moreover c" \= lim(c/M),d" ^ lim(c)/M). In partic¬ 
ular, \= ^{c”) A x(d"). On the other hand, by the choice of (p and the definition of 
P-, V there are some j < fc in / such that ^ C(®t) x(®fe) ^ <('(2i, Uj, L 2 , Ofc, Ig). This 
shows that the relation R{x,y) = (/>(/{,a;,d>Lg) is not homogeneous on A x F. 
As X, Y were arbitrary definable sets of positive measure, we conclude. 

(3) implies (1). Assume that T is not distal, and we will show that (3) cannot 

hold. Working in U we have some X = (aj : j G Q) for j G (1, 2,3} and a, b such 
that I = Ii + I 2 + I 3 , Ii + a + I 2 + I 3 and Ii + I 2 + b + I 3 are indiscernible, but 
Ii + a + I 2 + b + I 3 is not. This implies in particular that there is a formula (f> G L 
such that \= J 2 , b, Jg) for some finite J- C X with J' = {uj : j G Jj), but 

1= (j){J[,a', J 2 , b', Jg) for any a', b' G I such that J{ < a' < J 2 <b' < 

Let now R{x,y;c) := x, J 2 ,y, J 3 ) with c := J'lJ^Js- Assume that there 
are ilji{x,y),i G {1,2} and d > 0 as required by (3) for R. As T is NIP, it follows 
by Fact 16.91 that there are some k,N G uj such that for any indiscernible sequence 
K = {uj : j < n) with n > N and any di G U,z G {1,2}, the set tpi{K,di) is 
a disjoint union of at most k intervals. Without loss of generality it then follows 
from (3) that there is some k' G uj such that for any finite indiscernible sequences 
A = (ttj : j < n) and B = (bj : j < n) with n > N we can find intervals 
Ao C A, |Ao| > and Bq C B,|Bo| > such that (Aq.Bo) is R{x,y,c)- 
homogeneous. We are going to show that this property fails. 

Re-enumerating the sequence we may assume that X = I 13 + X^i + ... and 
/g = ... + /gp + /g^o, with each of Xj- indexed by Q, and that J( C X^o, J 3 GL Ig^o- 
Let X := h \ 2i,o, I 3 = h \ dg.o- 

By indiscernibility of I, automorphism and compactness for any Dedekind cuts 
c of X and c' of /g we can find some a' and b' which fill those cuts (separately, 
viewed as cuts in I) and such that |= , X, b', Jg) holds. 
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For each z S w, let (cij- : j € uj) he an infinite increasing sequence of cuts of 
/i.i, and let (c'^ : j € lu) he a decreasing sequence of cuts of Is^i. By the previous 
remark, let Qij and bij be such that aij fills the cut Cij, bij fills the cut c'^ and 
[= , Ui j, t/ 2 , , </ 3 ) holds. 

Next using Fact 16.81 and induction we can choose a^pb'^j such that: 



From this we have: 

(a) For any i,jGoj we have that \= -•(/>( J{, a'j-, J 2 , b[j,J^) holds. 

(b) The sequence I[ with all the {a'^ j : i,j G oj} added in the corresponding cuts 
is an indiscernible sequence, 

(c) The sequence /g with all the {b[j : i,j G uj} added in the corresponding cuts is 
an indiscernible sequence, 

(d) \= (j>{Ji,a', J 2 , b', Jg) holds for any a' G I{,b' G /g. 

Here (a) follows from the first bullet and the choice of aij,bij; using the second 
bullet above it is easy to show that (b) holds, and that the sequence has the same 
EM-type as I (similarly for (c)); (d) was already observed above. 

In view of (a)-(d) above, for any m G w we can choose indiscernible sequences 
A = (aj : j < 2k'm) and B = {bj : j < 2k'm) such that for any li,l 2 < 2/c' we have 
h im+/2 5 , c) &.Ild 1= , 6 / 2 m+j 2 5 3 -iiy Jl,j 2 € (2/u It 

then follows that for all sufficiently large m, for any choice of an interval ^ ^ 
with |Ho| > -^ > 2m and Bq C B with |i3o| > -^ > 2m, the sets (Hg, Hq) cannot 
be R{x, y; c)-homogeneous — a contradiction to the choice of k'. □ 

Remark 6.11. Pierre Simon has also observed a version of the implication (2) => 
(1) in Theorem 16.101 after seeing a preliminary version of our results. 

The above proof shows that an NIP theory is distal if and only if the property 
(3) in Theorem 16.101 holds for all finite indiscernible sequences A,B. As the follow¬ 
ing proposition shows, in an arbitrary NIP theory the property (3) almost holds 
for A, B indiscernible sequences, except for the uniform definability of one of the 
homogeneous subsets. 

Proposition 6.12. Let (j){x,y) be NIP. Then there is e > 0 depending only on 
(f> such that for any A = {ai : i < n) and B = {bi : i < m) indiscernible sequences 
(in fact A-indiscernible for some finite A depending just on (j) is enough) there are 
Aq <G A, Bq C B such that: |Ao| > e |A|, |ilo| > £ \B\ and either (j) (a, b) holds for 
all a G Ao,b G Bq or -<(j) (a, b) holds for all a G AQ,b G Bq. 

Proof. By Fact 16.91 there is k such that (f){x,y) can’t alternate on an indiscernible 
sequence more than k times. We divide B into k + 1 intervals of almost equal 
length. Namely, for i < A: -|-1 let 



Then for every a G A there is some interval Bi^ not containing any alternation 
points. It follows that for some i' < fc -|- 1, there are ^^-many points in A which 
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do not alternate inside Bi/, and then at least half of them satisfy (j) or So we 
can take e = 2(k+i) ■ ^ 
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